Tree-based reinforcement learning for optimal water reservoir operation
نویسندگان
چکیده
منابع مشابه
Operation Scheduling of MGs Based on Deep Reinforcement Learning Algorithm
: In this paper, the operation scheduling of Microgrids (MGs), including Distributed Energy Resources (DERs) and Energy Storage Systems (ESSs), is proposed using a Deep Reinforcement Learning (DRL) based approach. Due to the dynamic characteristic of the problem, it firstly is formulated as a Markov Decision Process (MDP). Next, Deep Deterministic Policy Gradient (DDPG) algorithm is presented t...
متن کاملTree Based Hierarchical Reinforcement Learning
In this thesis we investigate methods for speeding up automatic control algorithms. Specifically, we provide new abstraction techniques for Reinforcement Learning and Semi-Markov Decision Processes (SMDPs). We introduce the use of policies as temporally abstract actions. This is different from previous definitions of temporally abstract actions as we do not have termination criteria. We provide...
متن کاملDeep Reinforcement Learning based Optimal Control of Hot Water Systems
Energy consumption for hot water production is a major draw in high efficiency buildings. Optimizing this has typically been approached from a thermodynamics perspective, decoupled from occupant influence. Furthermore, optimization usually presupposes existence of a detailed dynamics model for the hot water system. These assumptions lead to suboptimal energy efficiency in the real world. In thi...
متن کاملHbmo in Optimal Reservoir Operation
The broad of applicability, ease of use, and global perspective of so-called meta-heuristic algorithms may be considered as the primary reason for their extensive application and success as search and optimization tools in various problem domains. Honey bees are among the most well-studied social insects. Their mating process may also be considered as a typical swarm-based approach to optimizat...
متن کاملTree-Based On-Line Reinforcement Learning
Fitted Q-iteration (FQI) stands out among reinforcement learning algorithms for its flexibility and ease of use. FQI can be combined with any regression method, and this choice determines the algorithm’s statistical and computational properties. The combination of FQI with an ensemble of regression trees gives rise to an algorithm, FQIT, that is computationally efficient, scalable to high dimen...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Water Resources Research
سال: 2010
ISSN: 0043-1397
DOI: 10.1029/2009wr008898